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Abstract 

The problem of computing functions of values at the nodes in a network in a totally distributed manner, 
where nodes do not have unique identities and make decisions based only on local information, has applications 
in sensor, peer-to-peer, and ad-hoc networks. The task of computing separable functions, which can be written as 
linear combinations of functions of individual variables, is studied in this context. Known iterative algorithms for 
fN| , averaging can be used to compute the normalized values of such functions, but these algorithms do not extend in 

5_i ■ general to the computation of the actual values of separable functions. 

P-i. The main contribution of this paper is the design of a distributed randomized algorithm for computing separable 

■ functions. The running time of the algorithm is shown to depend on the running time of a minimum computation 
\ algorithm used as a subroutine. Using a randomized gossip mechanism for minimum computation as the subroutine 

■ yields a complete totally distributed algorithm for computing separable functions. For a class of graphs with small 
spectral gap, such as grid graphs, the time used by the algorithm to compute averages is of a smaller order than the 

'■ time required by a known iterative averaging scheme. 

I. Introduction 

The development of sensor, peer-to-peer, and ad hoc wireless networks has stimulated interest in distributed 
. algorithms for data aggregation, in which nodes in a network compute a function of local values at the individual 
I> | nodes. These networks typically do not have centralized agents that organize the computation and communication 
^ ■ among the nodes. Furthermore, the nodes in such a network may not know the complete topology of the network, and 
q , the topology may change over time as nodes are added and other nodes fail. In light of the preceding considerations, 
"st" I distributed computation is of vital importance in these modern networks. 

We consider the problem of computing separable functions in a distributed fashion in this paper. A separable 
q . function can be expressed as the sum of the values of individual functions. Given a network in which each node 
""^5 | has a number, we seek a distributed protocol for computing the value of a separable function of the numbers at the 
O ■ nodes. Each node has its own estimate of the value of the function, which evolves as the protocol proceeds. Our 
I> . goal is to minimize the amount of time required for all of these estimates to be close to the actual function value. 

In this work, we are interested in totally distributed computations, in which nodes have a local view of the state 
j_j ■ of the network. Specifically, an individual node does not have information about nodes in the network other than 



its neighbors. To accurately estimate the value of a separable function that depends on the numbers at all of the 
nodes, each node must obtain information about the other nodes in the network. This is accomplished through 
communication between neighbors in the network. Over the course of the protocol, the global state of the network 
effectively diffuses to each individual node via local communication among neighbors. 

More concretely, we assume that each node in the network knows only its neighbors in the network topology, 
and can contact any neighbor to initiate a communication. On the other hand, we assume that the nodes do not have 
unique identities (i.e., a node has no unique identifier that can be attached to its messages to identify the source of 
the messages). This constraint is natural in ad-hoc and mobile networks, where there is a lack of infrastructure (such 
as IP addresses or static GPS locations), and it limits the ability of a distributed algorithm to recreate the topology 
of the network at each node. In this sense, the constraint also provides a formal way to distinguish distributed 
algorithms that are truly local from algorithms that operate by gathering enormous amounts of global information 
at all the nodes. 

The absence of identifiers for nodes makes it difficult, without global coordination, to simply transmit every 
node's value throughout the network so that each node can identify the values at all the nodes. As such, we develop 
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an algorithm for computing separable functions that relies on an order- and duplicate-insensitive statistic [20] of a 
set of numbers, the minimum. The algorithm is based on properties of exponential random variables, and reduces 
the problem of computing the value of a separable function to the problem of determining the minimum of a 
collection of numbers, one for each node. 

This reduction leads us to study the problem of information spreading or information dissemination in a network. 
In this problem, each node starts with a message, and the nodes must spread the messages throughout the network 
using local communication so that every node eventually has every message. Because the minimum of a collection 
of numbers is not affected by the order in which the numbers appear, nor by the presence of duplicates of an 
individual number, the minimum computation required by our algorithm for computing separable functions can be 
performed by any information spreading algorithm. Our analysis of the algorithm for computing separable functions 
establishes an upper bound on its running time in terms of the running time of the information spreading algorithm 
it uses as a subroutine. 

In view of our goal of distributed computation, we analyze a gossip algorithm for information spreading. Gossip 
algorithms are a useful tool for achieving fault-tolerant and scalable distributed computations in large networks. 
In a gossip algorithm, each node repeatedly iniatiates communication with a small number of neighbors in the 
network, and exchanges information with those neighbors. 

The gossip algorithm for information spreading that we study is randomized, with the communication partner of 
a node at any time determined by a simple probabilistic choice. We provide an upper bound on the running time 
of the algorithm in terms of the conductance of a stochastic matrix that governs how nodes choose communication 
partners. By using the gossip algorithm to compute minima in the algorithm for computing separable functions, we 
obtain an algorithm for computing separable functions whose performance on certain graphs compares favorably 
with that of known iterative distributed algorithms [5] for computing averages in a network. 

A. Related work 

In this section, we present a brief summary of related work. Algorithms for computing the number of distinct 
elements in a multiset or data stream [11], [2] can be adapted to compute separable functions using information 
spreading [7]. We are not aware, however, of a previous analysis of the amount of time required for these algorithms 
to achieve a certain accuracy in the estimates of the function value when the computation is totally distributed (i.e., 
when nodes do not have unique identities). These adapted algorithms require the nodes in the network to make use 
of a common hash function. In addition, the discreteness of the counting problem makes the resulting algorithms for 
computing separable functions suitable only for functions in which the terms in the sum are integers. Our algorithm 
is simpler than these algorithms, and can compute functions with non-integer terms. 

There has been a lot of work on the distributed computation of averages, a special case of the problem of 
reaching agreement or consensus among processors via a distributed computation. Distributed algorithms for 
reaching consensus under appropriate conditions have been known since the classical work of Tsitsiklis [25] and 
Tsitsiklis, Bertsekas, and Athans [26] (see also the book by Bertsekas and Tsitsiklis [4]). Averaging algorithms 
compute the ratio of the sum of the input numbers to n, the number of nodes in the network, and not the exact 
value of the sum. Thus, such algorithms cannot be extended in general to compute arbitrary separable functions. 
On the other hand, an algorithm for computing separable functions can be used to compute averages by separately 
computing the sum of the input numbers, and the number of nodes in the graph (using one as the input at each 
node). 

Recently, Kempe, Dobra, and Gehrke showed the existence of a randomized iterative gossip algorithm for 
averaging with the optimal averaging time [15]. This result was restricted to complete graphs. The algorithm 
requires that the nodes begin the computation in an asymmetric initial state in order to compute separable functions, a 
requirement that may not be convenient for large networks that do not have centralized agents for global coordination. 
Furthermore, the algorithm suffers from the possibility of oscillation throughout its execution. 

In a more recent paper, Boyd, Ghosh, Prabhakar, and Shah presented a simpler iterative gossip algorithm for 
averaging that addresses some of the limitations of the Kempe et al. algorithm [5]. Specifically, the algorithm and 
analysis are applicable to arbitrary graph topologies. Boyd et al. showed a connection between the averaging time 
of the algorithm and the mixing time (a property that is related to the conductance, but is not the same) of an 
appropriate random walk on the graph representing the network. They also found an optimal averaging algorithm 
as a solution to a semi-definite program. 
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For completeness, we contrast our results for the problem of averaging with known results. As we shall see, 
iterative averaging, which has been a common approach in the previous work, is an order slower than our algorithm 
for many graphs, including ring and grid graphs. In this sense, our algorithm is quite different than (and has 
advantages in comparison with) the known averaging algorithms. 

On the topic of information spreading, gossip algorithms for disseminating a message to all nodes in a complete 
graph in which communication partners are chosen uniformly at random have been studied for some time [12], [21], 
[9]. Karp, Schindelhauer, Shenker, and Vocking presented a push and pull gossip algorithm, in which communicating 
nodes both send and receive messages, that disseminates a message to all n nodes in a graph in O(logra) time 
with high probability [14]. In this work, we have provided an analysis of the time required for a gossip algorithm 
to disseminate n messages to n nodes for the more general setting of arbitrary graphs and non-uniform random 
choices of communication partners. For other related results, we refer the reader to [22], [16], [17]. We take note 
of the similar (independent) recent work of Ganesh, Massoulie, and Towsley [13], and Berger, Borgs, Chayes, and 
Saberi [3], on the spread of epidemics in a network. 

B. Organization 

The rest of the paper is organized as follows. Section [II] presents the distributed computation problems we study 
and an overview of our results. In Section [Till we develop and analyze an algorithm for computing separable 
functions in a distributed manner. Section [TV] contains an analysis of a simple randomized gossip algorithm for 
information spreading, which can be used as a subroutine in the algorithm for computing separable functions. In 
Section IVl we discuss applications of our results to particular types of graphs, and compare our results to previous 
results for computing averages. Finally, we present conclusions and future directions in Section [VI] 



We consider an arbitrary connected network, represented by an undirected graph G = (V,E), with \V\ = n 
nodes. For notational purposes, we assume that the nodes in V are numbered arbitrarily so that V = {1, . . . , n}. 
A node, however, does not have a unique identity that can be used in a computation. Two nodes i and j can 
communicate with each other if (and only if) E E. 

To capture some of the resource constraints in the networks in which we are interested, we impose a transmitter 
gossip constraint on node communication. Each node is allowed to contact at most one other node at a given time 
for communication. However, a node can be contacted by multiple nodes simultaneously. 

Let 2 V denote the power set of the vertex set V (the set of all subsets of V). For an n-dimensional vector 
x E R™, let xi, . . . , x n be the components of x. 



Definition 1: We say that a function / : R™ x 2 V —* R is separable if there exist functions f\,...,f n such that, 
for all S C V, 



Goal. Let T be the class of separable functions / for which fi(x) > 1 for all x E R and i = 1, . . . ,n. Given 
a function / E T, and a vector x containing initial values X{ for all the nodes, the nodes in the network are to 
compute the value f(x,V) by a distributed computation, using repeated communication between nodes. 

Note 1: Consider a function g for which there exist functions g\, . . . , g n satisfying, for all S C V, the condition 
g(x, S) = IlieS 9i( x i) i n ueu °f ©■ Then, g is logarithmic separable, i.e., / = log fe g is separable. Our algorithm 
for computing separable functions can be used to compute the function / = \og b g. The condition fi(x) > 1 
corresponds to gi{x) > b in this case. This lower bound of 1 on fi(x) is arbitrary, although our algorithm does 
require the terms fi(xi) in the sum to be positive. 

Before proceeding further, we list some practical situations where the distributed computation of separable 
functions arises naturally. By definition, the sum of a set of numbers is a separable function. 

(1) Summation. Let the value at each node be xi = 1. Then, the sum of the values is the number of nodes in the 



(2) Averaging. According to Definition [TJ the average of a set of numbers is not a separable function. However, 
the nodes can estimate the separable function Y17=i x i anc ^ n separately, and use the ratio between these two 
estimates as an estimate of the mean of the numbers. 



II. Preliminaries and Results 




(1) 



network. 
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Suppose the values at the nodes are measurements of a quantity of interest. Then, the average provides an 
unbiased maximum likelihood estimate of the measured quantity. For example, if the nodes are temperature 
sensors, then the average of the sensed values at the nodes gives a good estimate of the ambient temperature. 
For more sophisticated applications of a distributed averaging algorithm, we refer the reader to [18] and [19]. 

Averaging is used for the distributed computation of the top k eigenvectors of a graph in [18], while in [19] 

averaging is used in a throughput-optimal distributed scheduling algorithm in a wireless network. 

Time model. In a distributed computation, a time model determines when nodes communicate with each other. 

We consider two time models, one synchronous and the other asynchronous, in this paper. The two models are 

described as follows. 

(1) Synchronous time model: Time is slotted commonly across all nodes in the network. In any time slot, each 
node may contact one of its neighbors according to a random choice that is independent of the choices 
made by the other nodes. The simultaneous communication between the nodes satisfies the transmitter gossip 
constraint. 

(2) Asynchronous time model: Each node has a clock that ticks at the times of a rate 1 Poisson process. 
Equivalently, a common clock ticks according to a rate n Poisson process at times Cj~,k > 1, where 
{Cfc + i — Cfc} are i.i.d. exponential random variables of rate n. On clock tick k, one of the n nodes, say Ik, 
is chosen uniformly at random. We consider this global clock tick to be a tick of the clock at node 1^. When 
a node's clock ticks, it contacts one of its neighbors at random. In this model, time is discretized according 
to clock ticks. On average, there are n clock ticks per one unit of absolute time. 

In this paper, we measure the running times of algorithms in absolute time, which is the number of time slots 
in the synchronous model, and is (on average) the number of clock ticks divided by n in the asynchronous model. 
To obtain a precise relationship between clock ticks and absolute time in the asynchronous model, we appeal to 
tail bounds on the probability that the sample mean of i.i.d. exponential random variables is far from its expected 
value. In particular, we make use of the following lemma, which also plays a role in the analysis of the accuracy 
of our algorithm for computing separable functions. 

Lemma 1: For any k > 1, let Y\, . . . , Yf. be i.i.d. exponential random variables with rate A. Let = ^ Yli=i Yi- 
Then, for any e G (0, 1/2), 



Pr 



Rk-j 



>£) < 2exp(-^). 



>- < 2exp --]. (3) 



Proof: By definition, E[Rk] = \ 2~2i=i ^ 1 = ^ The inequality in © follows directly from Cramer's 
Theorem (see [8], pp. 30, 35) and properties of exponential random variables. ■ 

A direct implication of Lemma Q] is the following corollary, which bounds the probability that the absolute time 
Ck at which clock tick k occurs is far from its expected value. 

Corollary 1: For k > 1, E[Ck] = kjn. Further, for any e G (0, 1/2), 

Prf C k - h - ^ - ~ ( ^ 
\ n 

Our algorithm for computing separable functions is randomized, and is not guaranteed to compute the exact 
quantity f(x, V) = YJi=i fi( x i) at eacn node in the network. To study the accuracy of the algorithm's estimates, 
we analyze the probability that the estimate of f(x, V) at every node is within a (1 ± e) multiplicative factor of 
the true value f(x, V) after the algorithm has run for some period of time. In this sense, the error in the estimates 
of the algorithm is relative to the magnitude of f(x, V). 

To measure the amount of time required for an algorithm's estimates to achieve a specified accuracy with a 
specified probability, we define the following quantity. For an algorithm C that estimates f{x, V), let yi(t) be the 
estimate of f(x, V) at node i at time t. Furthermore, for notational convenience, given e > 0, let A^(t) be the 
following event. 

At(t) = {mit) $ [(1 - e)f(x, V), (1 + e)f(x, V)]} 

Definition 2: For any e > and 5 G (0, 1), the (e, 5)-computing time of C, denoted T^ mp (e, 5), is 
T^ mp (e, 5) = sup sup inf {r : Vt > r, Pr ( U™ =1 A\ (t)) < d}. 
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Intuitively, the significance of this definition of the (e, 5) -computing time of an algorithm C is that, if C runs for 
an amount of time that is at least T^ mp (e, 5), then the probability that the estimates of f(x, V) at the nodes are all 
within a (lie) factor of the actual value of the function is at least 1 — 5. 

As noted before, our algorithm for computing separable functions is based on a reduction to the problem of 
information spreading, which is described as follows. Suppose that, for i = 1, . . . , n, node i has the one message 
rrii. The task of information spreading is to disseminate all n messages to all n nodes via a sequence of local 
communications between neighbors in the graph. In any single communication between two nodes, each node 
can transmit to its communication partner any of the messages that it currently holds. We assume that the data 
transmitted in a communication must be a set of messages, and therefore cannot be arbitrary information. 

Consider an information spreading algorithm V, which specifies how nodes communicate. For each node i G V, 
let Si(t) denote the set of nodes that have the message m 8 at time t. While nodes can gain messages during 
communication, we assume that they do not lose messages, so that Si(t{) C Si (£2) if h < £2- Analogous to the 
(e, 5)-computing time, we define a quantity that measures the amount of time required for an information spreading 
algorithm to disseminate all the messages mi to all the nodes in the network. 

Definition 3: For 5 G (0, 1), the 5-information-spreading time of the algorithm V, denoted T^, pr (<5), is 

T%\6) = inf {t : Pr (U« =1 {S,(t) + V}) < 5} . 
In our analysis of the gossip algorithm for information spreading, we assume that when two nodes communicate, 
each node can send all of its messages to the other in a single communication. This rather unrealistic assumption 
of infinite link capacity is merely for convenience, as it provides a simpler analytical characterization of T^ mp (e, 5) 
in terms of T^ r (5). Our algorithm for computing separable functions requires only links of unit capacity. 



A. Our contribution 

The main contribution of this paper is the design of a distributed algorithm to compute separable functions of 
node values in an arbitrary connected network. Our algorithm is randomized, and in particular uses exponential 
random variables. This usage of exponential random variables is analogous to that in an algorithm by CoherQ for 
estimating the sizes of sets in a graph [6]. The basis for our algorithm is the following property of the exponential 
distribution. 

Property 1: Let W±, . . . , W n be n independent random variables such that, for i = 1, . . . , n, the distribution of 
Wi is exponential with rate Aj. Let W be the minimum of W\, . . . , W n . Then, W is distributed as an exponential 
random variable of rate A = Yl7=i 

Proof: For an exponential random variable W with rate A, for any z G R + , 

Pr(W > z) = exp(-A-z). 

Using this fact and the independence of the random variables Wi, we compute Pr(W > z) for any z G R + . 

Vi{W > z) = Pr (n^ =1 {Wi > z}) 

n 

= Y[Pr{Wi>z) 

i=l 



n 

= J^exp(-Aiz) 

i=l 




This establishes the property stated above. ■ 
Our algorithm uses an information spreading algorithm as a subroutine, and as a result its running time is a 
function of the running time of the information spreading algorithm it uses. The faster the information spreading 
algorithm is, the better our algorithm performs. Specifically, the following result provides an upper bound on the 
(e, <5)-computing time of the algorithm. 



'We thank Dahlia Malkhi for pointing this reference out to us. 
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Theorem 1: Given an information spreading algorithm V with (^-spreading time T^ T (5) for 5 G (0, 1), there 
exists an algorithm A for computing separable functions / G T such that, for any e G (0, 1) and S G (0, 1), 

TX P (s,6) = O (e~ 2 (l + log5- 1 )T^(5/2)) . 

Motivated by our interest in decentralized algorithms, we analyze a simple randomized gossip algorithm for 
information spreading. When node i initiates a communication, it contacts each node j ^ i with probability Py. 
With probability Pa, it does not contact another node. The nxn matrix P = \P%j\ characterizes the algorithm; each 
matrix P gives rise to an information spreading algorithm V. We assume that P is stochastic, and that pj = if 
i 7^ j and £ E, as nodes that are not neighbors in the graph cannot communicate with each other. Section 
HVl describes the data transmitted between two nodes when they communicate. 

We obtain an upper bound on the (^-information-spreading time of this gossip algorithm in terms of the conduc- 
tance of the matrix P, which is defined as follows. 

Definition 4: For a stochastic matrix P, the conductance of P, denoted ^(P), is 

3>(Pj = mm 77-7 . 

ScV,0<\S\<n/2 \S\ 

In general, the above definition of conductance is not the same as the classical definition [24]. However, we restrict 
our attention in this paper to doubly stochastic matrices P. When P is doubly stochastic, these two definitions are 
equivalent. Note that the definition of conductance implies that <£(P) < 1. 

Theorem 2: Consider any doubly stochastic matrix P such that if % 7^ j and £ E, then Py = 0. There 
exists an information dissemination algorithm V such that, for any 5 G (0, 1), 



,s P r. f 'logn + log* 1 

(5) = °( ¥(P) 



Note 2: The results of Theorems Q] and [2] hold for both the synchronous and asynchronous time models. Recall 
that the quantities T^ mp (e, 5) and T^ l {5) are defined with respect to absolute time in both models. 
A comparison. Theorems Q] and [2] imply that, given a doubly stochastic matrix P, the time required for our 
algorithm to obtain a (1 ± e) approximation with probability at least 1 — 5 is O — ( 1 + 1 °g' ? ^^ gn+logS — H . When 
the network size n and the accuracy parameters e and 5 are fixed, the running time scales in proportion to 1/$(P), 
a factor that captures the dependence of the algorithm on the matrix P. Our algorithm can be used to compute the 
average of a set of numbers. For iterative averaging algorithms such as the ones in [25] and [5], the convergence 
time largely depends on the mixing time of P, which is lower bounded by 0(1/$(P)) (see [24], for example). 
Thus, our algorithm is (up to a log n factor) no slower than the fastest iterative algorithm based on time-invariant 
linear dynamics. 

III. Function Computation 

In this section, we describe our algorithm for computing the value y = f(x, V) = Y^a=i fi{ x i) °f the separable 
function /, where fi(xi) > 1. For simplicity of notation, let yi = fi{xi). Given Xi, each node can compute yi on 
its own. Next, the nodes use the algorithm shown in Fig. [T] which we refer to as COMP, to compute estimates 
°f V = SILi Vi- T ne q uan tity r is a parameter to be chosen later. 



Algorithm COMP 

0. Initially, for i = 1, . . . , n, node i has the value yi > 1. 

1. Each node i generates r independent random numbers Wf,... ,W£, where the distribution of each W\ is 
exponential with rate y^ (i.e., with mean 

2. Each node i computes, for £ = 1, . . . , r, an estimate W\ of the minimum Wt = min™ =1 Wj. This computation 
can be done using an information spreading algorithm as described below. 

3. Each node i computes y^ = - -*- as its estimate of Y17=i Hi- 



Fig. 1. An algorithm for computing separable functions. 



We describe how the minimum is computed as required by step |2] of the algorithm in Section ITlI-AI The running 
time of the algorithm COMP depends on the running time of the algorithm used to compute the minimum. 
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Now, we show that COMP effectively estimates the function value y when the estimates W\ are all correct by 
providing a lower bound on the conditional probability that the estimates produced by COMP are all within a 
(1 ± e) factor of y. 

Lemma 2: Let yi, . . . ,y n be real numbers (with yi > 1 for i = 1, . . . , ri), y = 2~2i=i Vi> an ^ W = (^i> ■ • • > W r ), 
where the We sue as defined in the algorithm COMP For any node i, let W 1 = (Wf, . . . , W*), and let y» be the 
estimate of y obtained by node i in COMP For any e G (0, 1/2), 

(2 
_e_r 

Proof: Observe that the estimate yi of y at node i is a function of r and W % . Under the hypothesis that W l = W 
for all nodes i G V, all nodes produce the same estimate y = y^ of y. This estimate is y = r (X^?=i ^) > an ^ 
so y- 1 = (ELi^) 

Property Q] implies that each of the n random variables W\ , . . . , W r has an exponential distribution with rate y. 
From Lemma [TJ it follows that for any e G (0, 1/2), 



Pr 



> 



Vi G V, W % = W I <2exp 



e 2 r 



(4) 



This inequality bounds the conditional probability of the event {y 1 €" [(1 — e)y 1 , (l+s)y 1 ]}, which is equivalent 
to the event {y g" [(1 + e^y, (1 - e) _1 y]}. Now, for s G (0, 1/2), 



(1 - e)- 1 G [1 + e, 1 + 2e] , (1 + e)" 1 G [1 — e, 1 — 2e/3] 
Applying the inequahties in ((U) and (f5]), we conclude that for e G (0, 1/2), 



(5) 



Pr l\y - y\ > 2ey | Vi G V, W l 



<2exp( - — 



e 2 r 



Noting that the event U" =1 {|yi — y\ > 2ey} is equivalent to the event {\y — y\> ley] when W l = W for all nodes 
i completes the proof of Lemma |2] ■ 



A. Using information spreading to compute minima 

We now elaborate on step |2] of the algorithm COMP Each node i in the graph starts this step with a vector W % = 
(Wl, ■ ■ ■ , W*), and the nodes seek the vector W = (W\, . . . , W r ), where We = min™ =1 Wl. In the information 
spreading problem, each node i has a message nii, and the nodes are to transmit messages across the links until 
every node has every message. 

If all link capacities are infinite (i.e., in one time unit, a node can send an arbitrary amount of information to 
another node), then an information spreading algorithm V can be used directly to compute the minimum vector 
W. To see this, let the message at the node i be the vector W' L , and then apply the information spreading 
algorithm to disseminate the vectors. Once every node has every message (vector), each node can compute W as 
the component-wise minimum of all the vectors. This implies that the running time of the resulting algorithm for 
computing W is the same as that of the information spreading algorithm. 

The assumption of infinite link capacities allows a node to transmit an arbitrary number of vectors W % to 
a neighbor in one time unit. A simple modification to the information spreading algorithm, however, yields an 
algorithm for computing the minimum vector W using links of capacity r. To this end, each node i maintains a 
single r-dimensional vector w l (t) that evolves in time, starting with w l (0) = W l . 

Suppose that, in the information dissemination algorithm, node j transmits the messages (vectors) W %1 W 1 " 
to node i at time t. Then, in the minimum computation algorithm, j sends to i the r quantities w\, . . . , w r , where 
we = min^ =1 W\ u . The node i sets w\(t + ) = min(w\(t~),we) for I = 1, . . . , r, where t~ and t + denote the times 
immediately before and after, respectively, the communication. At any time t, we will have w l (t) = W for all 
nodes i G V if, in the information spreading algorithm, every node i has all the vectors W 1 , . . . , W n at the same 
time t. In this way, we obtain an algorithm for computing the minimum vector W that uses links of capacity r and 
runs in the same amount of time as the information spreading algorithm. 

An alternative to using links of capacity r in the computation of W is to make the time slot r times larger, 
and impose a unit capacity on all the links. Now, a node transmits the numbers w\ , . . . , w r to its communication 
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partner over a period of r time slots, and as a result the running time of the algorithm for computing W becomes 
greater than the running time of the information spreading algorithm by a factor of r. The preceding discussion, 
combined with the fact that nodes only gain messages as an information spreading algorithm executes, leads to the 
following lemma. 

Lemma 3: Suppose that the COMP algorithm is implemented using an information spreading algorithm V as 
described above. Let W l (t) denote the estimate of W at node i at time t. For any 5 € (0, 1), let t m = rT^{5). 
Then, for any time t > t m , with probability at least 1 — 5, W l (t) = W for all nodes i € V. 

Note that the amount of data communicated between nodes during the algorithm COMP depends on the values 
of the exponential random variables generated by the nodes. Since the nodes compute minima of these variables, 
we are interested in a probabilistic lower bound on the values of these variables (for example, suppose that the 
nodes transmit the values X/W\ when computing the minimum We = 1/ max" =1 {l/H 7 ^}). To this end, we use 
the fact that each We is an exponential random variable with rate y to obtain that, for any constant c > 1, the 
probability that any of the minimum values We is less than 1/B (i.e., any of the inverse values \/W\ is greater 
than B) is at most 5/c, where B is proportional to cry/ 5. 

B. Proof of Theorem [7] 

Now, we are ready to prove Theorem Q] In particular, we will show that the COMP algorithm has the properties 
claimed in Theorem Q] To this end, consider using an information spreading algorithm V with 5-spreading time 
T$ T (5) for 5 £ (0, 1) as the subroutine in the COMP algorithm. For any 5 G (0, 1), let r m = rT^(5/2). By 
Lemma [3j for any time t > r m , the probability that W % ^ W for any node i at time t is at most 5/2. 

On the other hand, suppose that W % = W for all nodes % at time t > r m . For any e € (0, 1), by choosing 
r > I2s~ 2 log(45 _1 ) so that r = 0(e~ 2 (l + logJ" 1 )), we obtain from Lemma |2] that 

Pr(u? =1 {&£[(l- £ )y,(l + e)y]} | V* € V, W l = w) < 6/2. (6) 

Recall that T^q MP (e, 5) is the smallest time r such that, under the algorithm COMP, at any time t > r, all the 
nodes have an estimate of the function value y within a multiplicative factor of (lie) with probability at least 
1 — 5. By a straightforward union bound of events and ©, we conclude that, for any time t > r m , 

Pr(U? = i {yi?[(l-e)y,(l+e)y}})<5. 

For any e £ (0, 1) and 5 G (0, 1), we now have, by the definition of (e, (5)-computing time, 

= 0(e- 2 (l +lo g r 1 )Ti pr (<5/2)). 

This completes the proof of Theorem [T] 

IV. Information spreading 

In this section, we analyze a randomized gossip algorithm for information spreading. The method by which nodes 
choose partners to contact when initiating a communication and the data transmitted during the communication are 
the same for both time models defined in Section ITT] These models differ in the times at which nodes contact each 
other: in the asynchronous model, only one node can start a communication at any time, while in the synchronous 
model all the nodes can communicate in each time slot. 

The information spreading algorithm that we study is presented in Fig. |2l which makes use of the following 
notation. Let Mi{t) denote the set of messages node i has at time t. Initially, Mj(0) = {m 4 } for all i G V. For a 
communication that occurs at time t, let t~ and i + denote the times immediately before and after, respectively, the 
communication occurs. 

As mentioned in Section III-Al the nodes choose communication partners according to the probability distribution 
defined by an n x n matrix P. The matrix P is non-negative and stochastic, and satisfies Pjj = for any pair 
of nodes i ^ j such that E. For each such matrix P, there is an instance of the information spreading 
algorithm, which we refer to as SPREAD(P). 
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Algorithm SPREAD(P) 

When a node i initiates a communication at time t: 

1. Node i chooses a node u at random, and contacts u. The choice of the communication partner u is made 
independently of all other random choices, and the probability that node i chooses any node j is Pij. 

2. Nodes u and i exchange all of their messages, so that 



Fig. 2. A gossip algorithm for information spreading. 

We note that the data transmitted between two communicating nodes in SPREAD conform to the push and pull 
mechanism. That is, when node i contacts node u at time t, both nodes u and i exchange all of their information 
with each other. We also note that the description in the algorithm assumes that the communication links in the 
network have infinite capacity. As discussed in Section IIII-AI however, an information spreading algorithm that 
uses links of infinite capacity can be used to compute minima using links of unit capacity. 

This algorithm is simple, distributed, and satisfies the transmitter gossip constraint. We now present analysis of 
the information spreading time of SPREAD(P) for doubly stochastic matrices P in the two time models. The goal 
of the analysis is to prove Theorem |2 To this end, for any i G F, let Si(t) C V denote the set of nodes that have 
the message rrii after any communication events that occur at absolute time t (communication events occur on a 
global clock tick in the asynchronous time model, and in each time slot in the synchronous time model). At the 
start of the algorithm, 5j(0) = {i}. 

A. Asynchronous model 

As described in Section [TTJ in the asynchronous time model the global clock ticks according to a Poisson process 
of rate n, and on a tick one of the n nodes is chosen uniformly at random. This node initiates a communication, 
so the times at which the communication events occur correspond to the ticks of the clock. On any clock tick, at 
most one pair of nodes can exchange messages by communicating with each other. 

Let k > denote the index of a clock tick. Initially, k = 0, and the corresponding absolute time is 0. For 
simplicity of notation, we identify the time at which a clock tick occurs with its index, so that Si(k) denotes the 
set of nodes that have the message rrii at the end of clock tick k. The following lemma provides a bound on the 
number of clock ticks required for every node to receive every message. 

Lemma 4: For any 6 € (0, 1), define 



Proof: Fix any node v £ V. We study the evolution of the size of the set S v (k). For simplicity of notation, 
we drop the subscript v, and write S(k) to denote S v (k). 

Note that |5(fc)| is monotonically non-decreasing over the course of the algorithm, with the initial condition 
1 5(0) | = 1. For the purpose of analysis, we divide the execution of the algorithm into two phases based on the size 
of the set S(k). In the first phase, |5(&)| < n/2, and in the second phase |5(/s)| > n/2. 

Under the gossip algorithm, after clock tick k + 1, we have either \S(k + 1)| = |5(fc)| or \S(k + 1)| = \S(k)\ + 1. 
Further, the size increases if a node i G S(k) contacts a node j ^ S(k), as in this case i will push the message 
m v to j. For each such pair of nodes i, j, the probability that this occurs on clock tick k + 1 is Pij/n. Since only 
one node is active on each clock tick, 



Mi(t+) = M u {t+) = M t {r) u M u {r). 



K(5) = inf{fc > 



Pr(ur =1 {5,(A;)^y})<5}. 



Then, 






n 



(7) 



ieS(k),jfS(k) 
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When |iS(A;)| < n/2, it follows from (|7]) and the definition of the conductance $>(P) of P that 



E[\S(k + 1)| - \S(k)\ | S(k)} > 



\S(k)\ £ 



i<=S(k),j£S(k) "ij 



> 



n 

\3(k)\ 



\S(k)\ 



mm 



n ScV, 0<|S|<n/2 

\S(k)\ i 



|5| 



n 



L*(P) 



(8) 



where $ 



»(P) 



We seek an upper bound on the duration of the first phase. To this end, let 

exp (jkj 



Z(k) 



\S(k)\ 



Define the stopping time L = ini{k : \S(k)\ > n/2}, and L A k = mm(L,k). If \S(k)\ > n/2, then LA(k + l) = 
L A k, and thus E[Z(L A (k + 1)) | S(L A ft)] = Z(L A k). 

Now, suppose that |S(A;)| < n/2, in which case L A (A; + 1) = (L A A;) + 1. The function 5(2) = 1/z is convex 
for z > 0, which implies that, for zi , 2:2 > 0, 



g{z2) > g(zi) + 3' (21) (22 - zi). 

Applying © with z\ = \S(k + 1)| and z 2 = \S(k)\ yields 

(\S(k + l)\-\S(k)\). 



(9) 



< 



1 



1 



\S(k + l)\~ \S(k)\ \S(k + l)\i 
Since |S(fc + 1)| < \S{k)\ + 1 < 2[5(fc)|, it follows that 



1 



< 



1 



1 



Combining © and £[0]), we obtain that, if |5(A;)| < n/2, then 



(1^ + 1)1-1^)1). 



(10) 



E 



1 



\S{k + l)\ 



S{k) 



< 



\S(k)\ 



exp - — 



as 1 — z < exp(— z) for z > 0. This implies that 
E[Z(L A (A; + 1)) j S(L A k)] 



E 



ex P (§(LA(A; + l)) 



|5(LA(fc + l))| 



= exp ^* (LA fc) j exp ^ 

< exp ( j(L A A;)J exp ^ 
= Z(LAk), 



S{L A jfe) 



L 



exp 



|S((LAfc) + l)| 
/ $\ 1 
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5(L A k) 



\S{LAk)\ 



and therefore Z(L A A;) is a supermartingale. 

Since Z(L A A;) is a supermartingale, we have the inequality E[Z(L A k)] < E[Z(L A 0)] = 1 for any k > 0, as 
Z(L A 0) = Z(0) = 1. The fact that the set S(k) can contain at most the n nodes in the graph implies that 



Z(L A jfe) 



exp f (LA k)\ ! 



|5(L A fc)| 



> — exp 

n 



(LA A) 
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and so 



E 



exp t^(LAk) 



< nE[Z(L A k)] < n. 



Because exp($(L A k)/A) j exp($L/4) as k — > oo, the monotone convergence theorem implies that 



E 



exp 



< n. 



Applying Markov's inequality, we obtain that, for k\ = 4 (In 2 + 2 Inn + In (l/<5) )/<!>, 

' ^ 2n 2 \ 



Pr(L > hi) = Pr exp 



> 



_5_ 

< 2n 



(11) 



For the second phase of the algorithm, when > n/2, we study the evolution of the size of the set of nodes 

that do not have the message, |S'(A;) C |. This quantity will decrease as the message spreads from nodes in S(k) to 
nodes in S(k) c . For simplicity, let us consider restarting the process from clock tick after L (i.e., when more 
than half the nodes in the graph have the message), so that we have |5(0) c | < n/2. 

In clock tick k + 1, a node j G S(k) c will receive the message if it contacts a node i £ S(k) and pulls the 
message from i. As such, 



E[\s(ky\-\s(k + iy\\s(kT]> £ 

ies(fc) c ,i^s , (fc) c 



.21 

n 



and thus 



E[\S(k + l) c | | S(k) c ] < \S(k) c 

= \S(k)< 



n 



YjjeS(k) c ,i<£S(k) c 



n\S(ky 



< \S(k) c \ 1-$ 



(12) 



We note that this inequality holds even when |5'(A;) C | = 0, and as a result it is valid for all clock ticks k in the 
second phase. Repeated application of (fT2~l) yields 



E[\s(ky\] = E[E[\s(ky\\s(k-iy}\ 
< (i - *) E[\s(k - iy\] 



< E[\s(oy\] 

< exp (-i/c) (J 1 

For k 2 = ln(n 2 /5)/2<l = (21nn + ln(l/<5))/$, we have E{\S(k2) c \} < 5/(2n). Markov's inequality now implies 
the following upper bound on the probability that not all of the nodes have the message at the end of clock tick 
&2 in the second phase. 

Pr(|S'(A; 2 ) c | > 0) = Pi(\S(k 2 ) c \ > 1) 

< E[\s(k 2 y\] 
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Combining the analysis of the two phases, we obtain that, for k' = ki+kz = 0((log ra+log 5 1 )/ < &), Pv(S v (k') ^ 
V) < 5/n. Applying the union bound over all the nodes in the graph, and recalling that <3> = $>(P)/n, we conclude 
that 

logn + \og5~ 1 



K{5) < k' = O (n- 



This completes the proof of Lemma |4] ■ 
To extend the bound in Lemma [4] to absolute time, observe that Corollary Q] implies that the probability that 

k = K(S/3) + 271n(3/(5) = 0(n(logn + log £ _1 )/3>(P)) clock ticks do not occur in absolute time (4/3)«/n = 
0((logn + log(5 _1 )/$(P)) is at most 25/3. Applying the union bound now yields T^ pREAD ^(5) = 0((logn + 
log<5 _1 )/<I>(P)), thus establishing the upper bound in Theorem [2] for the asynchronous time model. 



B. Synchronous model 

In the synchronous time model, in each time slot every node contacts a neighbor to exchange messages. Thus, 
n communication events may occur simultaneously. Recall that absolute time is measured in rounds or time slots 
in the synchronous model. 

The analysis of the randomized gossip algorithm for information spreading in the synchronous model is similar 
to the analysis for the asynchronous model. However, we need additional analytical arguments to reach analogous 
conclusions due to the technical challenges presented by multiple simultaneous transmissions. 

In this section, we sketch a proof of the time bound in Theorem^ T^ pREAD ^ (5) = 0((log n+log d^ 1 ) /<3?(P)), 
for the synchronous time model. Since the proof follows a similar structure as the proof of Lemma HI we only 
point out the significant differences. 

As before, we fix a node v € V, and study the evolution of the size of the set S(t) = S v (t). Again, we divide 
the execution of the algorithm into two phases based on the evolution of S(t): in the first phase \S(t)\ < n/2, and 
in the second phase \S(t)\ > n/2. In the first phase, we analyze the increase in while in the second we 

study the decrease in \S(t) c \. For the purpose of analysis, in the first phase we ignore the effect of the increase 
in \S(t)\ due to the pull aspect of protocol: that is, when node i contacts node j, we assume (for the purpose of 
analysis) that i sends the messages it has to j, but that j does not send any messages to i. Clearly, an upper bound 
obtained on the time required for every node to receive every message under this restriction is also an upper bound 
for the actual algorithm. 

Consider a time slot t + 1 in the first phase. For j ^ S{t), let Xj be an indicator random variable that is 1 if 
node j receives the message m v via a push from some node i G S(t) in time slot t + 1, and is otherwise. The 
probability that j does not receive m v via a push is the probability that no node i £ S{t) contacts j, and so 

E[Xj | S(t)] = 1 - Pr(Xj = | S(t)) 

= i - n a - p y ) 

ieS(t) 

> 1- J] exp(-Ptf) 
ieS(t) 

= 1 - exp I - P » ) ■ < 14 ) 

V ies(t) J 

The Taylor series expansion of exp(— z) about z = implies that, if < z < 1, then 

exp(-z) <l-z + z 2 /2 <l-z + z/2 = l- z/2. (15) 
For a doubly stochastic matrix P, we have < YlieSCt) A? — 1> anc ^ so we can com bine (fT4b and ( fT5l ) to obtain 



E[Xj | S{t)] >\Y, P *r 



2 

ieS(t) 
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By linearity of expectation, 

E[\S(t + l)\-\S(t)\\S(t)} = Y, E l x j\S<t)] 



3f£S(t) 

ieS{t)jffS(t) 

\S(t)\ Eiggft), jgg(t) P ij 

2 \S(t)\ 



When \S(t)\ < n/2, we have 



S[|5(t + l)[-|5(t)||5(t)]>|5(t)|^. (16) 

Inequality (TT6l ) is analogous to inequality ([8]) for the asynchronous time model, with $(P)/2 in the place of <£. 
We now proceed as in the proof of Lemma 0] for the asynchronous model. Note that \S(t + 1)[ < 2|5(t)| here in 
the synchronous model because of the restriction in the analysis to only consider the push aspect of the protocol in 
the first phase, as each node in S(t) can push a message to at most one other node in a single time slot. Repeating 
the analysis from the asynchronous model leads to the conclusion that the first phase of the algorithm ends in 
O ( log V(lp)' 5 ~ 1 ) time with probability at least 1 - 6/2n. 

The analysis of the second phase is the same as that presented for the asynchronous time model, with $ replaced 
by <£. As a summary, we obtain that it takes at most O ( log igpsy - — 1 time for the algorithm to spread all the 
messages to all the nodes with probability at least 1 — 5. This completes the proof of Theorem |2] for the synchronous 
time model. 

V. Applications 

We study here the application of our preceding results to several types of graphs. In particular, we consider 
complete graphs, constant-degree expander graphs, and grid graphs. We use grid graphs as an example to compare 
the performance of our algorithm for computing separable functions with that of a known iterative averaging 
algorithm. 

For each of the three classes of graphs mentioned above, we are interested in the <5-information-spreading time 
Tc?p READ , p J5), where P is a doubly stochastic matrix that assigns equal probability to each of the neighbors of 
any node. Specifically, the probability Py that a node i contacts a node j ^ i when i becomes active is 1/A, where 
A is the maximum degree of the graph, and Pa = 1 — ck/A, where di is the degree of i. Recall from Theorem Q] that 
the information dissemination algorithm SPREAD(P) can be used as a subroutine in an algorithm for computing 
separable functions, with the running time of the resulting algorithm being a function of T^ pREAD ^ (5). 

A. Complete graph 

On a complete graph, the transition matrix P has Pa = for i = 1, . . . , n, and Py = l/(n — 1) for j ^ i. 
This regular structure allows us to directly evaluate the conductance of P, which is &(P) ~ 1/2. This implies that 
the (e, <5)-computing time of the algorithm for computing separable functions based on SPREAD(P) is 0(e~ 2 (l + 
log(5 _1 )(logn + log5 _1 )). Thus, for a constant e G (0,1) and 5 = 1/n, the computation time scales as 0(log 2 n). 

B. Expander graph 

Expander graphs have been used for numerous applications, and explicit constructions are known for constant- 
degree expanders [23]. We consider here an undirected graph in which the maximum degree of any vertex, A, is 
a constant. Suppose that the edge expansion of the graph is 

l^,g c )l 
mm — = a, 

SCV, 0<|5|<n/2 |5| 

where F(S,S C ) is the set of edges in the cut (S,S C ), and a > is a constant. The transition matrix P satisfies 
Pij = 1/A for alH ^ j such that £ E, from which we obtain $(P) > a/A. When a and A are constants, 
this leads to a similar conclusion as in the case of the complete graph: for any constant e G (0, 1) and 5 = 1/n, 
the computation time is 0(log 2 n). 
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C. Grid 

We now consider a d-dimensional grid graph on n nodes, where c = n l / d is an integer. Each node in the grid 
can be represented as a d-dimensional vector a = (a^), where ctj E {1, . . . , c} for 1 < i < d. There is one node 
for each distinct vector of this type, and so the total number of nodes in the graph is c d = (n l / d ) d = n. For any 
two nodes a and b, there is an edge (a, b) in the graph if and only if, for some i G {1, . . . , d}, \ai — bi\ = 1, and 
aj = bj for all j ^ i. 

In [1], it is shown that the isoperimetric number of this grid graph is 

\F(S,S C )\ fl\ ( 1 
mm — = B I - I = fc) 



ScV, 0<\S\<n/2 \S\ \C J \n 1 / d 

By the definition of the edge set, the maximum degree of a node in the graph is 2d. This means that Pij = l/(2d) for 
all i / j such that G E, and it follows that <3?(P) = fl (-^rji). Hence, for any e G (0, 1) and 5 G (0, 1), the (e, 
5)-computing time of the algorithm for computing separable functions is 0(e~ 2 (l+log 5~ 1 )(log n+\og5~~ l )dn 1 / d ). 



D. Comparison with Iterative Averaging 

We briefly contrast the performance of our algorithm for computing separable functions with that of the iterative 
averaging algorithms in [25] [5]. As noted earlier, the dependence of the performance of our algorithm is in 
proportion to 1/$(P), which is a lower bound for the iterative algorithms based on a stochastic matrix P. 

In particular, when our algorithm is used to compute the average of a set of numbers (by estimating the sum of 
the numbers and the number of nodes in the graph) on a ^-dimensional grid graph, it follows from the analysis in 
Section IV-CI that the amount of time required to ensure the estimate is within a (1 ± e) factor of the average with 
probability at least 1 - 5 is 0(e~ 2 (l + log <5 _1 )(log n + log 5~ l )dn l l d ) for any e G (0, 1) and 5 E (0, 1). So, for 
a constant e G (0, 1) and 5 = 1/n, the computation time scales as 0(dn l / d log 2 n) with the size of the graph, n. 
The algorithm in [5] requires Vt{n 2 l d log n) time for this computation. Hence, the running time of our algorithm 
is (for fixed d, and up to logarithmic factors) the square root of the runnning time of the iterative algorithm! This 
relationship holds on other graphs for which the spectral gap is proportional to the square of the conductance. 



VI. Conclusions and Future Work 

In this paper, we presented a novel algorithm for computing separable functions in a totally distributed manner. 
The algorithm is based on properties of exponential random variables, and the fact that the minimum of a collection 
of numbers is an order- and duplicate-insensitive statistic. 

Operationally, our algorithm makes use of an information spreading mechanism as a subroutine. This led us 
to the analysis of a randomized gossip mechanism for information spreading. We obtained an upper bound on 
the information spreading time of this algorithm in terms of the conductance of a matrix that characterizes the 
algorithm. 

In addition to computing separable functions, our algorithm improves the computation time for the canonical 
task of averaging. For example, on graphs such as paths, rings, and grids, the performance of our algorithm is of 
a smaller order than that of a known iterative algorithm. 

We believe that our algorithm will lead to the following totally distributed computations: (1) an approximation 
algorithm for convex minimization with linear constraints; and (2) a "packet marking" mechanism in the Internet. 
These areas, in which summation is a key subroutine, will be topics of our future research. 
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